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In the Claims: 

1. (Original) A method of indexing a database of documents, 
comprising: 

providing a vocabulary of n terms; 

indexing the database in the form of a non-negative n x m index matrix F, 

wherein: 

m is equal to the number of documents in the database; 
n is equal to the number of terms used to represent the database; and 
the value of each element vy of index matrix V is a function of the number of occurrences 
of the i th vocabulary term in the j th document; 

factoring out non-negative matrix factors T and D such that 

V**TD\ and 

wherein T is an n x r term matrix, D is an r x m document matrix, and r < 

nm/(n+m). 

2. (Original) The method of claim 1 further comprising deleting said 
index matrix V. 

3. (Original) The method of claim 2 further comprising deleting said 
term matrix T. 

4. (Original) The method of claim 1 wherein r is at least one order of 
magnitude smaller than n. 
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5. (Original) The method of claim 1 wherein r is from two to three 
orders of magnitude smaller than n. 

6. (Original) The method of claim 1 wherein entries of said document 
matrix D falling below a predetermined threshold value t are set to zero. 

7. (Original) The method of claim 2 wherein r is at least one order of 
magnitude smaller than n. 

8. (Original) The method of claim 2 wherein r is from two to three 
orders of magnitude smaller than n. 

9. (Original) The method of claim 2 wherein entries of said document 
matrix D falling below a predetermined threshold value t are set to zero. 

10. (Original) The method of claim 3 wherein r is at least one order of 
magnitude smaller than «. 

11. (Original) The method of claim 3 wherein r is from two to three 
orders of magnitude smaller than n. 
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12. (Original) The method of claim 3 wherein entries of said document 
matrix D falling below a predetermined threshold value / are set to zero. 

13. (Currently Amended) A method of indexing a database of documents, 
comprising: 

providing a vocabulary of n terms: 

indexing the database in the form of a non-negati ve n x m index matrix P\ 

wherein: 

m is equal to the number of documents in the database; 
n is equal to the number of terms used to represent the database; and 
the value of each element V j of index matrix V is a function of the number of occurrences 
of the i rh vocabulary term in the j th document; 

factoring out non-negative matrix factors rand D such that 

V * TP. Th e m e thod of claim 1 wherein said factoring out of non • n e gative matrix factors 
rand D further comprises: 

selecting a cost function and associated update rules from the group: 

F = t ZK iog(rz)),-(rz)),] 

cost function ,=l J = [ associated with 

update rules J V ,iJ , ' ,and v JiJ ' , 
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cost function 


1=1 7=1 


v„ 


- 1 associated with 


update rules 


r (td\ 


and 


7 (re), 

A , and 


cost function 1=1 associated with update rules 


'* and * * )* ;and 

iteratively calculating said update rules so as to converge said cost 
function toward a limit until the distance between V and TD is reduced to or beyond a 
desired value. 


14. (Original) A program storage device readable by machine, tangibly 

embodying a program of instructions executable by the machine to perform method steps 
for indexing a database of documents, said method steps comprising: 
providing a vocabulary of n terms; 

indexing the database in the form of a non-negative n x m index matrix V, 


wherein: 


m is equal to the number of documents in the database; 

n is equal to the number of terms used to represent the database; and 

the value of each element Vn of index matrix V is a function of the number 


of occurrences of the i* vocabulary term in the j* document; 
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factoring out non-negative matrix factors T and D such that 
V* TD\ and 

wherein T is an n x r term matrix, D is an r x m document matrix, and r < 

nm/(n+m). 


15. (Original) A database index, comprising: 

an r x m document matrix Z), such that 
V*TD 

wherein T is an n x r term matrix; 

V is a non-negative n x m index matrix , wherein each of its m columns 
represents an j th document having n entries containing the value of a function of the 
number of occurrences of a I th term appearing in said j th document; and 

wherein T and D are non-negative matrix factors of V and r < nml{n+m)\ 

and 

wherein each of the m columns of said document matrix D corresponds to 
saidj th document. 


16. (Original) A method of information retrieval, comprising: 

providing a query comprising a plurality of search terms; 
providing a vocabulary of n terms; 

performing a first pass retrieval through a first database representation and 
scoring m retrieved documents according to relevance to said query; 
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executing a second pass retrieval through a second database representation 
and scoring documents retrieved from said first pass retrieval so as to generate a final 
relevancy score for each document; and 

wherein said second database representation comprises anrxm document 
matrix Z), such that 

V*TD 

wherein T is an n x r term matrix; 

V is a non-negative n x m index matrix , wherein each of its m columns 
represents an j th document having n entries containing the value of a function of the 
number of occurrences of a I th term of said vocabulary appearing in said j th document; and 

wherein T and D are non-negative matrix factors of V and r < nml{n+m)\ 

and 

wherein each of the m columns of said document matrix D corresponds to 
said j th document. 

17. (Original) The method of claim 16 wherein said final relevancy score 
for any j th document is a function of said j th document s corresponding entry in said 
document matrix D and the corresponding entries in said document matrix D of the T 
top-scoring documents from said first pass retrieval. 

18. (Original) The method of claim 17 wherein said relevancy score 
function for said f 1 document is proportional to a sum of cosine distances between said j* 
document s corresponding entry in said document matrix D and each of said 
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corresponding entries in said document matrix D of the T top- scoring documents from 
said first pass retrieval. 

19. (Original) The method of claim 16 wherein r is at least one order of 
magnitude smaller than w. 

20. (Original) The method of claim 16 wherein r is from two to three 
orders of magnitude smaller than n. 

21. (Original) The method of claim 16 wherein entries of said document 
matrix D falling below a predetermined threshold value t are set to zero. 

22. (Original) A program storage device readable by machine, tangibly 
embodying a program of instructions executable by the machine to perform method steps 
for information retrieval, said method steps comprising: 

providing a query comprising a plurality of search terms; 
providing a vocabulary of n terms; 

performing a first pass retrieval through a first database representation and 
scoring m retrieved documents according to relevance to said query; 

executing a second pass retrieval through a second database representation 
and scoring documents retrieved from said first pass retrieval so as to generate a final 
relevancy score for each document; and 
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wherein said second database representation comprises anrx/n document 
matrix D, such that 

V*TD 

wherein T is an n x r term matrix; 

V is a non-negative n x m index matrix , wherein each of its m columns 
represents an j th document having n entries containing the value of a function of the 
number of occurrences of a I th term of said vocabulary appearing in said j 1 * 1 document; and 

wherein T and D are non-negative matrix factors of V and r < nm/(n+m)\ 

and 

wherein each of the m columns of said document matrix D corresponds to 
said j th document. 
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